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A quantum copying machine producing two (in general non-identical) copies of an ar- 
bitrary input state of a two-dimensional Hilbert space (qubit) is studied using a quality 
measure based on distinguishability of states, rather than fidelity. The problem of produc- 
ing optimal copies is investigated with the help of a Bloch sphere representation, and shown 
to have a well-defined solution, including cases in which the two copies have unequal quality, 
OO . or the quality depends upon the input state (is "anisotropic" in Bloch sphere language), or 

0^ ' both. A simple quantum circuit yields the optimal copying machine. With a suitable choice 

of parameters it becomes an optimal eavesdropping machine for some versions of quantum 
cryptography, or reproduces the Buzek and Hillery result for isotropic copies. 
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The no-cloning theorem is one of the most important features which distinguishes quantum from 
^■f^ I classical information theories. There is no difficulty making an arbitrary number of copies of any type of 

. information which arrives over a classical channel. However, to copy or "clone" the information which arrives 

' over a quantum channel is not possible without producing errors if it is encoded in terms of nonorthogonal 

, quantum states. This fact is the basis for various schemes of quantum cryptography since the attempt 

CD ■ of an eavesdropper to tap into a quantum channel and duplicate the information in it will result in errors 

detectable by the legitimate users. 

Even in cases in which perfect copies are excluded because of the no-cloning theorem, it is possible to 
produce imperfect copies which are better than random noise. An optimal copying machine is one in which, 
by means of a suitable unitary transformation, a certain number of copies of the original information are 
produced with the smallest number of errors, or minimal amount of noise. Optimality in this sense will 
obviously depend upon the ensemble of input states, the number of copies, and the measure employed for 
^ ' noise or error rate. Obtaining quantitative estimates of what is possible for optimal copying is a significant 

problem in quantum information theory, and it is to this problem that the present paper is addressed. 

A number of previous studies have used various input ensembles and measures of quality or of noise. Buzek 
and Hillery discussed an "isotropic" , or "universal" , copying machine Q , which produces two identical copies 
. from one qubit (two-state system), with copy quality independent of the input state. This copying machine 

5h ' was later proved by Bruf5 et al. g] to be optimal among all possible isotropic machines, if the measure of 

quality is the fidelity between the input and the output: that is, the probability that the output will be 
measured to be in the same state as the input. Gisin and Massar [|6| considered a case in which M identical 
copies are generated from N identical qubits {M > N), and found that the average fidelity reaches its 
maximum when the machine is isotropic. Bruf5, Ekert, and Macchiavello Q found an interesting connection 
between the optimal isotropic copying process and quantum state estimation by measurement. Later, the 
optimal M-io-N problem was generalized by Werner Q to systems of arbitrary dimension. Various quantum 
circuits implementing some of the isotropic copying machines mentioned above are presented in |^,|l^. Brufiet 
al. also considered an input ensemble of two non-orthogonal states, and found that the optimal copying 
process requires no ancillary qubits, unlike the case of isotropic copying. 

In contrast to the work just discussed, we will use a quality measure based upon the distinguishability of 
the output states, rather than the fidelity, and will consider non-identical and anisotropic copies for which 
the copying quality can depend upon the input state. This is motivated by a study of eavesdropping in 
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quantum cryptography [Q. Eavesdropping is, in essence, a matter of producing imperfect "copies" of the 
input signal in some channel in a way which enables the eavesdropper to distinguish the different states in 
the original information, while at the same time perturbing it as little as possible. Thus distinguishability is 
more important to the eavesdropper than whether his copies resemble the original states. In addition, the 
amount of information obtained by the legitimate receiver and by the eavesdropper are generally different, 
and can depend on the input states. Since the problem of optimizing a copy machine for completely general 
inputs and outputs is quite difficult, we restrict ourselves to the case of only two copies of one qubit. Even 
with this restriction, the mathematical structure is nontrivial. Recently Cerf fl^ explored this problem by 
investigating a family of "Pauli cloning machines." He derived a no-cloning inequality for isotropic copies, 
and established a bound for the quantum capacity of the corresponding "Pauli channel." 

We introduce the background material needed for later sections in Sec. ||, including a formalism for the 
general interaction of one qubit with another quantum system, a set of superoperators, and the Bloch sphere 
representation. In Sec. Ill, the quality function of an output qubit is defined, and it is expressed using the 
Bloch sphere representation; this is the measure we employ to define optimal copying. Given the quality 
of one copy, the best quality for the other is determined by a map G, which is derived in Sec. IV for a 
special case. However, this special case is as good or better than any other copying scheme, as shown in 
Sec. Section ^ discusses a number of properties of the map G, the conditions under which a pair of 
copies produced are actually optimal, and particular examples that are useful in revealing relations between 
the copying qualities. In Sec. VII we introduce two quantum circuits implementing the optimal copying 
process, and show how they can be turned into an eavesdropping machine or a univ ersal copying machine. 
Finally, the results are summarized and some open questions are noted in Sec. VIII . Various mathematical 
details are placed in Appendices A-D. 



II. GENERAL FORMULATION OF THE COPYING PROBLEM 



We define the copying problem in the following way. The qubit which is to be copied is a state in a 
two-dimensional Hilbert space A with orthogonal basis |0) and The copy machine corresponds to a 
unitary transformation U on 

n^A<»£ ^B^£ =B^C(»V, (2.1) 

where we suppose that the copies lie in the two dimensional spaces B and C after the action of U, and the 
space £ = C i^V can have an arbitrarily large but finite (even) dimension. The identical spaces A and B are 
denoted differently in order to distinguish the input space from the output. In particular, 
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Ui\j)^\e))=J2\l)^\ei), (2.2) 
where |e) is a normalized state of £, and therefore the states \e-j) of £ satisfy 



el|ef)=<5,fc. (2.3) 



There is no loss of generality in assuming a pure state |e) rather than a density matrix for £, since a density 
matrix can always be thought of as arising from a partial trace on a space of higher dimension, and the 
dimension of £ is arbitrary. 

With |e) fixed, define the linear map V : A-^B ® £ through 
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V\a) = U{\a) ® |e)) = Y.^ai\a)) <E> \ei), (2.4) 

(=0 



where co is the identity, and ci, (T2, and 173 are the usual Pauli matrices ax-, (Jy, and Cz, which in (2.4) should 
be thought of as maps from A to B, assuming some basis |0), |1) for the latter; here |0) and |1) correspond 
to Sz eq ual t o -t-1/2 and —1/2, respectively, and (|0) -I- |1))/a/2 to Sx = +1/2 in the usual spin half notation. 



Thus if (2A) is written out explicitly, it takes the form: 
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l-|0) = |0)|eo) + |l)|ei)+i|l)|e2) + |0)|e3), 

I^|l) = |l)|eo) + |0)|ei)-.|0)|e2)-|l)|e3), (2.5) 

where, following the usual convention, we omit ^ if this causes no confusion. The four |e;) are vectors in £ 
which are linear combinations of the je'^), and whose inner products form a 4 x 4 positive (i.e., non-negative 
eigenvalues) Hermitian matrix E: 

Elm = {ei\em) = E^i. (2.6) 

Note that given a positiv e H ermitian matrix E, it is always possible to find a set of four vectors |e;) such 
that Elm takes th e fo rm (2.6), and this set is unique up to an arbitrary unitary transformation on £. 

The condition (2.3) implied by the unitarity of U is equivalent to the requirement that V be an isomctry, 
or that 

FV-^£;,„,t7,a™ = ao, (2.7) 



Irn 



the identity on and thus to 



= (2-8) 



(=0 



Re{EQq)^lTn{Eq,q„), (2.9) 



where Re and Im refer to real and imaginary parts, and ( |2.9| ) employs a notation which we will use again 
and again: 

(q, q\ q") - (1, 2, 3) or (2, 3, 1) or (3, 1, 2). (2.10) 
That is, the triple {q, q', q") is an even permutation of the integers 1, 2, and 3. Thus, for example, Re[£'o2] = 

llJl[E3i]. 

Given V, the corresponding superoperator V which maps operators on A to operators on Ti. takes the form 

(Jjnak ^ |ej)(efe|. (2-11) 

jk 

In particular, for / = 0, 1, 2, 3, 

^i'^i) ^^L{jk\lm)am \ej){ek\ ^^(Jm(S)Fim, (2.12) 

jkm m 

where the Fim are a collection of 16 Hermitian operators on £ defined by 

i^im =^i(iA:;/m)|e,)(efc|, (2.13) 

jk 

and 

L{jk; Im) = ^TT[aj(Tiak<Jm] (2-14) 

can be thought of as a 16 x 16 matrix, with rows labeled by jk and columns by Im. Its properties are 
discussed in App. In particular, is its own inverse. When j = k oi when I — to, L{jk; Im) vanishes 
unless the other pair is also equal: 

L{jk; II) = <5,fcA(j, Z), L(jj, Im) = 5z„,A(j, Z), (2.15) 

where A(j, ^) is a 4 x 4 real matrix, (|A8|), whose square is 4 times the identity. 

The superoperators Vg, Vc, and , which play a central role in the following discussion, map operators 
on A to operators on B, C, and respectively and are defined using partial traces of Y: 



3 



Vein) = Tre^p [!/(«)] = TrviVsi^i)] 



The isometry condition (2.7) implies that V preserves the trace, 

Tr„[l/(K)] =Tr^M, 



(2.16) 



(2.17) 



and this property is inherited by each of the superoperators Vis, Vc, and Vg, with the trace over Ti. in ( 2.17 ) 
replaced by a trace over B, C, and £, respectively. (Henceforth we shall not always indicate explicitly the 
space over which a trace is to be taken if it is obvious from the context.) In addition, it is obvious from 
( ^.ll|) that V maps Hermitian operators to Hermitian operators, and this property is also inherited by Vg, 
Vc, and Vg. 

It is convenient to write 



(2.18) 



where 



Bin 



^L{lm;jk)Ejk 



(2.19) 



is a real 4x4 matrix of the form: 



B 



/I 




Vo 



5^ 



Si 



B 



"3 



(2.20) 



where columns and rows are in the order 0,1,2,3, and B is a, real 3x3 matrix. The form of the first 



(left-most) column of B is equivalent to the isometry condition (2.7), that is, to ( |2.8| ) plus (2.£), as can be 
sho wn b y inserting in ( 2.19 ) the matrix eleme nts fo r the L giv en in App. It implies that the superoperator 
in ( 2.1^ ) preserves the trace. The inverse of ( |2.19D , see (AlC), is 



(2.21) 



im 



The vector d — {Si , $2 ,6l^) and the matrix B in ( p.2C| ) can be given a geometrical interpretation using 
the well-known Bloch sphere representation 



(2.22) 



of a density matrix p in a two-dimensional Hilbert space, where r = (ri, r2, r^) is any real v ector of le ngth 
less than 1 for a mixed state, or equal to 1 for a pure state, and cr = (ci, (72, tra). Then ( 2.18 ) and ( 2.20| ) tell 
us that 



^{(70 + s ■ cr), 



where [Ol 



r ■ B 



B^ 



(2.23) 



(2.24) 



and B^ is the transpose of B. In particular, if ( 2.24 ) is applied to the collection of pure states, \r\ = 1, 
which form the unit Bloch sphere of A, the resulting collection of s vectors forms an ellipsoid in the Bloch 
sphere of B, which we shall call the B ellipsoid, or simply ellipsoid, with center at . 
The polar decomposition of B takes the form 



B = P • B' 



(2.25) 



4 



where P is a proper rotation, a real orthogonal 3x3 matrix with determinant 1, and is a real symmetric 
matrix. By diagonalizing B" ^ one can always write B in the form 



B = P' B'^ 



(2.26) 



where P' and P" are proper rotations, and B"^ is a diagonal matrix with (real) eigenvalues 61,62,63. The 
rotations P' and P" correspond to unitary transformations on A and B, respectively. Consequently, by 
adopting suitable orthonormal bases for A and we can, assuming U to be given, always arrange to have 
the matrix B in the form 



B = 



5f 


"2 


"3 


61 











62 











63 



(2.27) 



parameterized by 5^ — (<5f , (5^, (^^) and b ~ ( 61, 62 , 63). (Note that in the process of diagonalizing the 
general form ( p. 20 ) to obtain the diagonal form (2.27), the values of the 5^ will, in general, change.) 

Two vectors corresponding to an orthonormal basis or mode for A are represented by opposite poles 
rh = (mi,m2,Tn,3) and —rh of the Bloch sphere of A, where we use a hat to indicate a unit vector. Under 



(2.24) these map to opposite points of the B ellipsoid; that is, the line connecting them passes through the 
center 8^ of the ellipsoid. The three principal axes of the B ellipsoid correspond to three principal modes of A, 
represented in the Bloch sphere of ^ by mutually perpendicular unit vectors (and their negations) Mi, M2, 
and Ms- If bases are chosen such that B has the form ( ^.27 ), the mode vectors Mj are along the Cartesian 



axes, and correspond, in the language of spin half, to a value of +1/2 for S'^^, 5^, and Sz, respectively. (It is, 
of course, important to distinguish "orthogonal" with reference to the complex Hilbert spaces A or B, from 
"perpendicular" as it refers to real three-dimensional vectors in the Bloch sphere representation!) Also, \bj\ 
is the length of the j'th principal semi-axis of the ellipsoid. 

Because P' and P" are proper rotations (each has determinant 1), it is not always possible to make all 
the bj, j — 1, 2, 3, in ( p. 27 ) positive (non- negative) . If an even number of the hj are negative, then P' and 
P" can be chosen so they are all positive, but if an odd number are negative, the best one can do is to have 
two positive and one negative. Which of these situations occurs depends upon J7, and turns out to be of 
some significance for optimal copying. 

Since C is als o a t wo-dimensional space, we can use a Bloch sphere representation for Vq and write the 
counterpart of ( 2.23| ) as 



Vc(p)=p" = i(^o+t-T), 



(2.28) 



where 



t = 5^ + r • C = 5^ 



The counterpart of (2.20) is a 4 x 4 matrix 



C 




(2.29) 



(2.30) 



where C is a 3 x 3 matrix which can be diagonalized by the same strategy employed earlier in the case of 
-B, though the principal modes of A singled out by this construction will, in general, not be the same as 
those corresponding to the B ellipsoid, so we label the perpendicular unit vectors with a prime: M^, Mj, 

and Afg. We shall denote by c = (ci, 02,03) the values on the diagonal when C is diagonalized; hence \cj\ 
is the length of the j'th principal semi-axis of the C ellipsoid. 

The possible values of b an d 6^ are constrained by the fact that, whatever U may be, t he ma trix which 



determines B throu gh (2.19| ), and is determined by B through the inverse relationship ( 2.21 ), satisfies the 



isometry condition (2 



and (2.9) and has non-negative eigenvalues. One consequence is the fact that the 
B ellipsoid must always lie inside the unit Bloch sphere of B. These constraints are found explicity for the 
_ ri :„ c — FTTl u„i — T„.4- — j„ „„i 1 j-i,„:„ c — , :„ „i xhere are, of course, identical 



in Sec. IV below, but we do not know their form in general. 



constraints on the possible values of c and 6 
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III. ERROR RATES AND QUALITY FUNCTION 



Consider the problem of a general channel of the type introduced in Sec. |n[ from a qubit .4 to a Hilbert 
space T (which could be S or C or f ), described by a superoperator Vjr. Suppose that symbols a\ and ai are 
encoded in basis states rh and — m in the Bloch sphere representation of and the corresponding states 
are mapped by Vjf to the two density matrices and p\: in T . Now suppose that every time a signal a\ or 
a-i is sent, a measurement is carried out on T corresponding to a decomposition of the identity 



IT 



^E^'^' (3-1) 



where the {i^fe} are projectors, or, more generally, positive operators ||T^. Further suppose that if the result 
of this measurement is ffe corresponding to -Ffe, an estimate of the original symbol is made according to a 
fixed guessing function^ 5(aj|/fc) which is either or 1, and satisfies 

EfKIA) = l- (3.2) 

3 

That is, if g{ai\j\) — 1, then if /i is measured, one guesses that a2 was sent. If aj is sent with probability Pj, 
one can show that the average error rate (guessing a\ when 02 was sent, or vice versa) using this procedure 
is bounded below, see [|6l, by the (achievable) minimum error rate 



£ . - 1 



1 - Tr 



{Ipip't - P2P%\)] , (3.3) 



where the absolute value |C| of a Hermitian operator C is the operator obta ined by replacing the eigenvalues 



of C with their absolute values in its spectral decomposition. Motivated by (3^), we define the quality factor 
Qjr(m) or "distinguishability" for mode rh as 

Q^(m) = iTr - = Tr(|f7^(m)|) , (3.4) 

where 

r!^(m) - V>(im • cr) = ^rh-V^{(T). (3.5) 

Thus Qjp-(m) is a number between and 1, equal to 1 — 2emin in the case in which pi — P2 — 1/2. A quality 
factor of 1 means the two signals can be perfectly distinguished, while means that measurements are no 
more effective than random guesses. 

The quality factor as a function of mode, Qjr(rn), will be called the quality function. Knowledge of this 
function provides a general measure of "distinguishability" for pairs of non-orthogonal as well as orthogonal 
signals in for if ai and 02 correspond to density matrices 

^ fy'^^\{l + x^-<t) (3.6) 



in the distinguishability of the corresponding signals in compare (3.4), is 

\'^^{\Mpa)^Mpa)\) = \xTr{\x.Ma)\) =\xQAx), (3.7) 
where ;e is a unit vector in the direction of a;^ — a;^, and x — — x'^\. 



The quality factor Qri'rh) for mode rh is, (3^), the sum of the absolute values of the eigenvalues of 



VLjr[rh). When T = H, Vjf = V, and since V is an isometry, V{^rh • cr) has two non-zero eigenvalues, -1-1/2 



^One can employ values for g{aj\fk) between and 1, provided (3.2) is satisfied, for a random guessing process in 
which g{aj\fk) is the probability of guessing aj given fk- However, one can show that this more general procedure 
does not result in a smaller error rate, so we shall not consider it further. 
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and —1/2, the same as those of |m • cr. Hence Q-H(rn) = 1, independent of rh. When = B or £, it is 
helpful to note that fig and ilg can be obtained from partial traces of 



3 3 
9=1 1=0 



see (2.12). In particular, one has 



(3.8) 



(3.9) 



since Bim, (2.19), is the trace of Fim, and Bqo — 0, see (2.20). The eigenvalues of flis{rh) are ±i|m • B\, 
where m • B (see the discussion in Sec. ||) is the vector from the center of the ellipsoid to the point on its 
surface which is the image, under Vg, of rh, and \rh • B\ is its length. The same argument applies when J- 
is C, and thus we have 



QB{ra) = \ra ■ Bl Qc{rh) ^ \rh ■ C\. 
If we write rh in terms of the principal modes of A relative to the B and C ellipsoids. 



■J o 



(3.10) 



(3.11) 



respectively, the two quality functions can be written explicitly as: 

3 3 



j=i i=i 



(3.12) 



where the bj and the Cj are the diagonal elements of the diagonalized matrices B and C, and represent in 
each case the length of the j'th principal semi-axis of the ellipsoid. 
In the case T = E , we have 



rif (m) 



9=1 



(3.13) 



upon tracing (3.8) over B. There is no simple expression (known to us) for the spectrum of r2f:(rn), and 
we cannot represent the quality function Q^im) using a Bloch sphere. Nonetheless, it is worth noting that 
Q^irh) is completely determined if the superoperator Vg, or, equivalently, the matrix i?, is given. That is 



because B determines ( 2.21 ), and as noted below Eq. (2^), it thus determines Fi^, see ( 2.13 ), up to a 
unitary transformation on As a consequence, this transformation cannot change Q^irn) since it leaves 
the spectrum of ^^(m) the same. On the other hand, Qc("^) is not uniquely determined by Vg, because 
there are many ways in which E can be expressed as a tensor product C (^T). Nonetheless, it is clear that 



Qc(m) < Q£(m), 



(3.14) 



because measurements on C alone clea rly ca nnot do better at distinguishing signals than those carried out 
on the full space E. A formal proof of ( ^.14 ) is obtained by noting that 



^c{rh) = Tivinsirh)], 



(3.15) 



and using theorem 3 of App. B. 



IV. OPTIMAL COPYING: CENTERED ELLIPSOID 



We approach the problem of optimal copying in the following way. Suppose a copying machine produces 
one copy in the B output channel characterized by a quality function Qg(rn) which, as noted in Sec. [II, 
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is completely specified by a set of principal modes in A together with the semi-axes of the corresponding 
ellipsoid in the B Bloch sphere, the absolute values of 61, 62, and 63. We then ask, given b = (61, 62, ^3), what 
is the best possible copy which can be produced in a separate one qubit (two-dimensional) output channel 
C; that is, what is the best or largest quality function Qciiri)? 

It is not obvious at the outset that this question has a well-defined answer, for one can certainly imagine 
that with a fixed b, one type of copy machine might produce an optimal quality in C for mode rh, and a 
different machine would be needed to make the best copies for a different mode rh . But we will show that 
for a given b there is, indeed, a single machine which produces the best possible copies in C, as measured 
by Qc, for every rh. That is, given 6, there is a largest ellipsoid in the Bloch sphere of C, characterized by 
semi- axes ci, C2, and C3, with 



c = (ci,C2,C3) = G(6), 



(4.1) 



a well-defined function of b. The construction, incidentally, yields non- negative Cj. 

In the present section, the optimization map G will be derived by considering the special case of an ellipsoid 



centered in the Bloch sphere of B, that is, ^ =0 in (2.27). That considering this special case is actually 



sufficient will be shown in Sec. 0, where we use a concavity property of the quality function to show that 
the quality cannot be further improved by using an ellipsoid which is not centered in the Bloch sphere of 
B. However, given the pair b and c = G{b), it is not at all obvious that the corresponding copy machine is 
optimal, since there might be a machine which produces better copies both in the B channel and in the C 
channel. This possibility is studied below in Sec. VI, where we also work out cert ain p roperties of G. 

If the |e/) in (2.4) are mutually orthogonal, the matrix Eim of inner products, (2.6) is diagonal, and can 
be written in the form 



Elm = PfSlri 



(4.2) 



where the Pi are non-negative numbers. Further, wc can choose an orthonormal basis {|e/)} in £ such that 

\ei)^Pi\ei), (4.3) 



The symbol (3 without a subscri pt w ill be used to denote the four vector {(3o, Pi, P2, Ps)- Because of (|2.15| ) 
if E is diagonal, the matrix B, ( ^.1£ ), is also diagonal, 



Bim — biSim, 

and we will use b without a subscript for the four vector of diagonal elements: 

b={b^MMM) = (i,b). 

Using (^T9|) , (|2T5| ), and (U), we can express the elements of b in terms of the Pj through 

b = K-p\ p^^\A-b, 
where the dot denotes matrix multiplication, and 

P^ = {PlPlPlPl) 



(4.4) 
(4.5) 
(4.6) 
(4.7) 



is a four vector whose components are the squares of those of p. If (4.6) is written out explicitly one ffnds: 



3=0 



b, = Po + Pi - Pi - Pi" for 9 = 1,2, 3. 



(4.8) 
(4.9) 



bmce the off-diagonal 5^ in (|2.20D is zero, ([l.4|), the ellipsoid is centered, and the fact that B is diagonal 
means that the principal axes of the ellipsoid coincide with the Cartesian axes of the Bloch sphere. Note 
that the property last mentioned can always be achieved by choosing an appropriate orthonormal basis in B, 
so that there is no loss of generality in as suming, if th e ellipsoid is centered, that the B matrix is diagonal, 
of the form (14). But in that case ( 2.21 ) — note ( 2.15| ) — tells us that the matrix E is diagonal, so that the 
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\ei) must be mutually orthogonal. Thus by considering all mutually orthogonal sets of |e;) , we take care of 
all cases in which the B ellipsoid is centered. The fact that each component of 0^ in (4.6) is non-negative, 
together with = 1, places a set of four constraints on the components of b: 
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-1; 6„ + 6„, < 1 + 6„// for g = 1,2,3. 



(4.10) 



These are the necessary and sufficient conditions for b to be physically possible when 8^ — Q, and they 
specify that it lies within a tetrahedron with vertices 

(1,-1,-1), (-1,1,-1), (-1,-1,1), (1,1,1). (4.11) 

As noted at the end of Sec. b in ([4.9| ) determines Q£-(rh) but not Qcijn)^ which depends on the tensor 
product structure C oi £. One choice that maximizes Qci'rh) is the set of basis vectors |e;) with 



V2|eo) = |00) + |11), 
V2|e2) =-i|01)+i|10), 



V2|ei) = |01) 
V2|e3) = |00) 



|10), 
111), 



(4.12) 



where orthonormal basis vectors for the two-dimensional space C are den oted by |0) and |1), the same 
notation is used for the basis of P, and in the kets on the right sides in (4.12) the C label is to the left of the 
T) label, thus \cd). The space T) is two-dimens ional , but could be higher -dime nsio nal wit h a dditio nal b asis 
vectors |2), |3), ...etc., which do not appear in ( 4.12| ). Substituting (4.12) into ( |4.3| ), and (4.3) into (2.5), we 
obtain the explicit expressions 

72^^10) = (/3o + /33)|000) + (/3o - /33)|011) + (/3i + /32)|101) + (/3i - /32)|110), 

^/2V\l) = iPo - /33)|100) + (/3o +/?3)|111) + (A - /32)|001) + (A + /32)|010), (4.13) 

where the labels in the kets on the right side are in the order B, C, V, that is to say, \bcd). Note that in Bloch 
sphere notation, ( p.22D , |0) corresponds to r = (0,0, 1), |1) to r = (0,0,-1), (|0) + |l)]/\/2 to r (1,0,0), 
etc. Keeping this correspondence in mind, it is easy to check that the superoperator Vg based upon ( 4.13 ) 
maps r = (1,0^0), (0,1,0), and (0,0,1) to s = (6i,0, 0), (0,62,0), and (0,0,63), respectively, where the bq 
are given by (|4.9| ). 

We now define a four vector 7 — (70, 71, 72, 73) in terms of /3 through: 



7 = iA./3, /5 = iA.7, 
and rewrite (t4.13|) in terms of the as: 

V2V\0) = (70 + 73)1000) + (71 + 72)1011) + (70 - 73)1101) + (71 - 72)1110), 
V2V\1) = (71 + 72)1100) + (70 + 73)1111) + (71 - 72)1001) + (70 - 73)1010). 



(4.14) 



(4.15) 



Note that if the first two labels, the B and C bits, are i nterchanged on the right side of ( 4.13| ) and, at the 
same time, each Pj is replaced by 7j, the result is ( 4.15 ); that is to say, the 7j play precisely the same role 
for the space C as the j3j for the space B. Consequently, we can immediately conclude that the ellipsoid in 
the Bloch sphere of the C channel, produced by the action of Vc, is centered and has principal semi-axes ci, 
C2, and C3 lying parallel to the three Cartesian axes, with 



(C0,C1,C2,C3) = (l,c) = A . 7^ 7^ 



iA-c, 



(4.16) 



the counterpart of ([l.6|), and 7^ the four vector whose j'th component is 7^. Inserting ( 4.14 ) in ( 4.16 ), one 
obtains explicit expressions for the Cq with q > 0: 



2(l3ol3q+Pq.Pqn 



(4.17) 



where q' and q" follow the notation of ( ^.10| ). In addition, since the principal axes of the ellipso ids in the 
B and C Bloch spheres correspond to the same principal modes in A, we can omit the primes in (3.11) and 
(3.12). Thus, in particular, the quality function for C is given by 
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(4.18) 



The next step is to show that (4. IS) is, indeed, the optimal quahty function, given 61, b2, and 63. To 
demonstrate that this is the case, we shall show that (3.14) is satisfied as an equality: Qc(^) ~ QsiiT^)- In 



order to evaluate Qeirh), one needs to find the eigenvalues of the Hermitian operator D,£{rh) — V£{^rh • cr) 
whose matrix 



(ez|17£(m)|e„) = ^ HjL{lm; jO)Pi(3„ 

3 = i 



(4.19) 



when written out explicitly using (3.13) and (2.13), takes the form: 



/ fJ-lPoPl M2/?0/?2 M3^0^3 

M2/3o/32 -«Ai3/^i/32 iMi^2^3 
Vm3/3o/33 iM2^1^3 -«Ail^2/33 



(4.20) 



In ( p^ and ( ^^ we have replaced each component ruq of ifi by the corresponding /iq, since — see ( |3.1l[) 
the principal modes Mj are along t he co ordinate axes. 
The characteristic polynomial of ( 4.20| ) is 



det(f^£ - XI) = - TA^ + A^, 



where — once again employing the notation in (2.10) 

3 



r = E /^'(/5o/3' + /?''/3^), A = /3o/3i/32/33(E ^')- 

q=l 9=1 



(4.21) 



(4.22) 



Evidently, the eigenvalues of Qs occur in pairs ±Ai, ±A2, where we assume that Ai and A2 are positive. 
Thus TrdOfI) is 2(Ai + A2), and one finds, after a bit of algebra, that 



Q£(m) = Tr(\ne{rh)\) = VT + 2\A\ = (c,7^,)'} 



where 



2{\PoP,\ + \Pg'Pg"\)- 



(4.23) 



(4.24) 



Absolute value signs have been inserted because even though wc have assumed tha t (3j a re non-negative in 
(4.2), we could use negative values in (4.19) or ( 4.2C| ), and it is clear from (4.21) and (4.22) that the spectrum 
of Qs depends only on t he squ ares of the /?, . O n the other hand, as long as the f3j are non-negative, one 
sees, by comparing ( 4.17 ) with ( 4.24 ), and ( 4.18 ) with ( 4.23 ), that QciiTi) and ©^(m) are, indeed, identical. 
Hence, if the ellipsoid is centered in the B Bloch sp here , and the values 61, 62, and 63 are specified, the 
construction associated with ( 4.12 ) or, equivalently, ( 4.13| ), with the (3j assumed to be non-negative, yields 
copies in th e C channel of the highest possible quality, whatever mode is chosen; see the discussion associated 

with (|t|). 



V. NON-CENTERED CASE: SYMMETRY AND CONCAVITY 



A. Centered Case is Optimal 

The problem of optimal copying has been solved in Sec. |l^ in the sense that if the ellipsoid is centered 
in the B channel and has principal semi-axes b = (61,62,63), then the optimal quality for copies in the C 
channel can be achieved using a centered ellipsoid with principal semi-axes c — (01,02,03) — G{b). Could 
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the copies in C be be further improved by employing a copy machine for which the B elhpsoid is not centered 
in the Bloch sphere? That it cannot follows from the argument given below, which is base d upon s ome 
properties of of Qsirh) which are themselves not without interest, and are derived in Sec. VB and Sec. VC 
below. 



As noted at the end of Sec. [II, Qf (rh) for a fixed mode rh is completely determined by the 4x4 matrix 
B of ( 2.201 ). Since for the following argument we can assume that rh is held fixed, it will be convenient to 
denote the dependence of Q£{rh) on B by Q£[i3] or, when B has the diagonal form ( 2.27 ), by Q£[b,5^], 



suppressing the reference to rh. Here b is the set of ellipsoid semi-axes and 8 is the displacement of the 
ellipsoid from the center of the Bloch sphere. 

The first property of [B] which is of interest is the symmetry 



(5.1) 



proved in Sec. |VE| . The second, proved in Sec. VC, is that ©^[-B] is a concave function in th e se nse that, 
whenever B^ and B^ are acceptable B matrices, that is, generated by isometrics of the form (2.4), and pi 
and p2 are positive numbers summing to 1, piB^ + p2B^ is also an acceptable matrix, and 



(5.2) 



Given the symmetry and concavity properties just mentioned, the fact that an "off center" copying machine 
cannot improve upon the "centered" version of Sec. IV can be shown in the follow way. Let the copy machine 
produce an ellipsoid [b, S^] in the B channel, and consider a particular mod e rh for which this machine 
produces a copy quality Qciin) in the C channel; of course, Qc("^) cannot, ( 3.14 ), exceed Qsirh), which 
we denote by Qe[b, (5^]. Then, with an obvious choice for B^ and B^, and with pi = P2 = 1/2, ( ^.2[ ) tells us 
that 



Q£[b,0] > {Qs[b,S''] + Q£[b,-5^])/2 = Q£[b,5' 



(5.3) 



where we have made use of the symmetry ( ^.l[ ). In Sec. |^ we showed that for a centered ellipsoid [b, 0] it 
is possible to construct an optimal copying machine which in the C channel produces copies of quality 



Qr(rn) = Q£[6,0], 



(5.4) 



and, consequently, the C quality for the "off center" machine is bounded by 

Qc(m) < QTi^)- 



(5.5) 



Since this is true for any mode rh, no improvement over the "centered" machine of Sec. IV is possible 



B. Proof of the Symmetry Property 



The result (5.1) can be established as follows. Let V in (2.4) be the isometry which gives rise to an 



ellipsoid [b,S ] , and define a second isometry 

3 

y'|a)=t/(|a)®|e))=^(ana))®|el) 



(5.6) 



(=0 



where the |ej) are obtained from the |e;) by an anti-linear operation ("time reversal") as explained below. 

Choose a reference orthonormal basis {l^j}} for £, and for any |e) in £, define its "complex conjugate" 
|e*) as the vector with components 



{r,\e*)^{r,\er. 

Likewise, if F is any operator on £, the operator F* is defined by the matrix elements 

{r,\F*\r,) = {r,\F\r,r, 



(5.7) 
(5.8) 



11 



that is, the matrix of F* in this basis is the complex conjugate of that of F. Of course, both |e*) and 
F* depend upon which reference basis is used, but that is immaterial as long as the latter is held fixed 
throughout the following argument. For later reference, note the relationships 



(elD = (e|/)*, Tr[F*] = (Tr[F])*. 
Now define the |ej) introduced in ( p^ ) by 

|eo) = |eo>; 10 for g = 1,2, 3. 



It is easy to check that (2.8) and (2.9) are satisfied by the matrix elements 



(5.9) 
(5.10) 
(5.11) 



if they are satisfied by the Eim, so V with the jej) defined in ( 5.1C| ) is, indeed, an isometry. The corresponding 
superoperator V' , see ( ^.11 ) to ( 2.15| ), can be written as 



(5.12) 



with 



(5.13) 



It is then a straightforward exercise to show, using the properties of L{jk; Im) given in App. A, that 

(5.14) 



F' — -\- F* 



where the sign is — for ^ = 0, m > and for I > 0, m = 0; and is + in all other cases. Consequently, the 
elements of the B' matrix corresponding to V' , sec ( 2.19 ), are given by 



(5.15) 



with the sign following t he sa me rule as in ( 5.14 ); note that the complex conjugate can be ignored when we 
use the trace formula in (|5.9|), because the i3;,„ are real. Therefore B' is the same as B, except that 



B' 



Qq 



Bog for q = 1,2,3; 



(5.16) 



we need not concern ourselves with the BqQ, because they vanish. That is, B' is obtained from B by reversing 
the sign of , precisely wh at w e need in order to investigate (5.1). 



The final step in proving (5T) is to use ( 3.15 ) for both V' and V to show that 

3 3 
g=l q=l 



(5.17) 



Since ri£(m) is a Hermitian operator, its eigenvalues are the same as those of r2J(m), and since the quality 
factor is the sum of the absolute values of the eigenvalues, we conclude that 



Q^(m) = Tr(|0^(m)|) = Tr(|f^£(Tn)|) = Qe{ni), 



(5.18) 



which is (5.1). 
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C. Proof of the Concavity Property 



To establish (5.2) we proceed as follows. Let and V"^ be any two isometries of the type (2.4): 



(5.19) 



(=0 



Since there is no constraint upon the dimension of the space £, we can assume that the vectors |e[) are 
chosen in such a way that 



(e/|el)=0 



(5.20) 



for all / and m; in other words, the subspace off containing the vectors {|e;^)} is orthogonal to that containing 
the vectors {|e^)}. This, of course, places no restriction upon the matrices 



(5.21) 



aside from the isometry conditions (2^) and (2^), which must be satisfied separately for r — 1 and 2. 
Now construct a third isometry 

V ^ + (5.22) 

where pi and p2 are non-negative numbers whose sum is 1. That this V is an isometry satisfying (pj) is a 
consequence of the fact that both V'^ and are isometries, alo ng wi th ( 5.2C ), which implies that (V'^)^y^ 
and {V'^yV^ both vanish. The superoperator corresponding to (5.22) takes the form 



V{k) = VkV^ = PiV\k) +P2V^{h) + ^/pIp^W{k), 
where is the superoperator corresponding to , and 

W{k) = V^K{Vy + V^K{Vy 

has the property that 

Tr£[VK(At)] =0. 



(5.23) 
(5.24) 
(5.25) 



To see that (5.25) is the case, note that ( 5.20| ) implies that we can choose an orthonormal basis {lep} for £, 
with r = 1 or 2, and k taking on as many values as necessary, such that 



{el\ei)^0^{ef\ei) 



(5.26) 



whatever the values of I and k. Ev aluating the trace (5.25) in this basis shows that it is zero. 
Hence the partial trace of ( 5.23| ) over £ yields 



V8(^) =PiV^{n)+p2V^{K), (5.27) 

and, as a consequence of (2.18), the corresponding Ax A B matrices of Sec. || are related by 

B=piB^+P2B^. (5.28) 

Since B^ and B^ can be any two B matrices which are physically p ossib le — the isometries and V"^ are 
arbitrary — and pi and p2 any positive numbers whose sum is one, (5.28) shows that the set of physically 
possible B matrices generated by isometries of the type (2.4) is convex. 
Tracing (5.23) over B yields 



V£{k)^Pi%\k) +p2V^{K)+^^W£iK), 

and W£{k) does not (in general) vanish. If wc write the identity on £ in the form 



(5.29) 



(5.30) 
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where 

^'^=El4)(sn (5-31) 

3 

is a projector onto the subspace containing the {lej")}, then 

P^Ve{K)P^ ^P^V^{k), P^V£iK)P^ ^p2ViiK), (5.32) 
and theorem 2 in App. B tell us that 

TT{\Vei^)\)>p,TT{\V,\>,)\)+p2TT{\V^{n)\). (5.33) 



Substituting k — ^rh • cr into this expression yields (5.2) 



VI. OPTIMAL PAIRS AND OPTIMIZATION MAP 
A. General Conditions 

In Sec. 1^ we showed that if b is the three vector of principal axes for the ellipsoid of the B channel, 
there i s a largest ellipsoid for the C channel, characterized by principal axes c and given by the function 
G(6), ( [4.l| ), which represents the best possible copy in C if 6 is held fixed. Note that the components of 
c = G{b) are all positive, whereas those of b may be either positive or negative. In this section we examine 
the optimization function G{b) in a bit more detail, and deduce necessary and sufficient conditions for an 
optimal pair (6, c) of copies. 

If / = (/i, /2, /a) is a three vector with real components, we define its absolute value to be 

/+ = (l/lU/2M/3|). (6.1) 

We shall write f > provided each fp, 1 < p < 3, is non-negative, and / > if, in addition, at least one fp 
is strictly positive. The relationship f > g means f — g > 0, or fp > gp for 1 < p < 3, whereas f > g means 
f — g > 0. The same notation will be used for a four vector such as /3 = (/3o, Pi,P2, Ps)', is its absolute 
value in the sense of (|6.l| ) , and /3 > means that at least one component is strictly positive and the rest are 
non-negative. 



A three vector b will be said to be possible provided it satisfies (|4.10| ) and is thus a possible collection 
of semi-axes (some of which may be negative) for the ellipsoid in B, while a possible pair (6, c) is one that 
corresponds to some conceivable copying machine, i.e., is allowed by the laws of quantum mechanics. The 
pair (6, c) will be said to be better (in quality) than (b, c) provided 



b <b^, c+ < c+, (6.2) 
and at least one of these inequalities is strict, e.g., c+ < c^. If both inequalities are equalities, the pairs are 



equiv alent (in quality). We use absolute values in (6.2) because they alone enter the quality function, see 



(3.12). Finally, an optimal pair (b, c) is one for which no other possible pair can be found which is better. 
Note that if (b, c) is optimal, so also is (c, 6), for it obviously does not matter which output channel is labeled 
B and which is labeled C. 

For any possible 6, the argument in Sec. IV, supplemented by the results in Sec. 0, tells us that (b, G(b)) 
is a possible pair, and for any other possible pair of the form (b, c) it is the case that 

c+ < G'(b); (6.3) 

recall that, by construction, G(b) > 0, so it is not necessary to take the absolute value on the right side. This 
does not mean that given any possible b, (b, G(b)) is an optimal pair in the sense defined above, because 
there might very well be a positive optimal pair (b', c') with b'^ < b' and G(b) < c'. Indeed, this can occur 
even in cases in which b = b^ has all positive components. However, if (b, c) is an optimal pair, then so is 
(b,G(b)), and 

c+ = G(b). (6.4) 
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The reason is that th e pa ir (b,G{b)) is possible, and were it better than (b, c), that is, were ( p!^ ) the 
inequahty allowed by (S.3), then (b, c) would not be an optimal pair. Thus we see that if (b, c) is optimal, 
so is (b, c+), and, by the same sort of argument, (b^, c) and (b"^, c+) are also optimal pairs. 

An optimal pair (b, c) with b > and c > will be called a positive optimal pair. Note that if we reverse 
the signs of some of the components of b or c, or both, the resulting pair will also be optimal provided it is 
possible. Thus the task of characterizing optimal pairs can be divided into two parts: finding necessary and 
sufficient conditions for positive optimal pairs, and conditions under which reversing some of the signs in a 
positive optimal pair results in a possible, and therefore optimal pair. 

The optimization map G(b) described in Sec. 



IV is obtained through a series of steps which can be 



represented schematically in the form: 



(6.5) 



in terms of the four-vectors introduced in Sec. IV; see, e.g., (4.5) and ( 4.16| ). Here < — > stands for the 



reversible transformation obtained by multiplying a four vector by the matrix A and perhaps a positive 
constant: see the explicit expressions in (^!q), (4.14), and (4.16). The single arrow means taking the square 



of each component; thus is (/?q, (i^Tpz)- This operation is not reversible because it is many to one or, 
equivalently, because the square root ca n be eithe r po sitive or negative. (Note that > is a consequence 
of the positivity of the matrix i?;™, see ( ^.6[ ) and (4.2), so wc do not have to worry about imaginary roots.) 
Thus in going from the left side to the right side of (6.5) we have to specify the square root of The map 
G(b) defined in Sec. IV employs the positive root, /3 > 0. 

As a consequence, whereas one or more of the components of b may be ne gativ e (recall that 5o = 1 = co), 
the components of c, which can be expressed in terms of those of /? using (4.17), are always non-negative: 
G(b) > 0. On the o ther hand, /3 > does not imply that the components of 7 in ( |6.5| ) are positive; indeed, 
the counterpart of (4.17), 



hq = 2(707q -|-7g'79") 



(6.6) 



for 1 < g < 3, shows that if some hq is negative, some of the 7^ must be negative as well. 

If in (3.6) we replace every component of 7 by its absolute value, the result will be a vector b' with, 
obviously. 



b+ < b' = G(c), 



(6.7) 



since G(c) results from following the chain in (6.5) from right to left with 7 replaced by the positive square 
root of 7^ 



By examining (6.6) for each p in turn, and noting that 70 must be strictly positive (half the sum of the 
/9j), it is easy to show that a necessary and sufficient condition for equality between b^ and b' in ( ^.7| ) is 
that 



74 := 7o7i7273 > 0. 



222 
7o7i72 



222 

-7o7i73 



222 
7o72 73 



,222, /2| 2, 2, 2\ 

+7i72 73 + 74(70 + 7i + 72 + 73): 



a consequence of (3.6), tells us that if (b,c) is an optimal pair, then 



6162&3 > 0. 



(6.8) 



Consequently, if 74 is negative, (b, c) with c = G(b) cannot be an optimal pair, since (b', c) with b' ~ G{c) 
is possible, and better. In addition. 



(6.9) 



(6.10) 



The reason is that if (6.1C) is violated, so is (6.8), which would mean that (6.7) is a strict inequality, and 
( G(c) , c) is better than (b, c). But this is not possible if (b, c) is optimal. Note that (6.8) is stronger than 
( |6.10 ), so even if ( |6.10 ) is satisfied, (b, G(b)) need not be optimal. Of course, ( 6.1C ) with b replaced by c 
must also ho ld for any optimal pair (b, c). 

However, ( ^.ICj ) is enough to settle the question of whether, given a positive optimal pair (b, c), a pair 
(b', c') obtained by reversing the signs of one or more components of b or c (or both) is possible, and hence 
optimal. The answer is that b' is either b itself, or obtained from b by reversing the signs of precisely two 
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components of b, leaving the sign of the other component fixed. If b has a component equal to zero, this 
can, but need not be counted among the two whose sign is reversed, and thus any or all of the non-zero 
components can be changed in sign without violating ( |6.10| ) or the two component rule just mentioned. The 
same relationship must hold between c' and c. Now since reversing the sign of two of the components of b 
can be achieved using a suitable unitary transformation on B, see the discussion in the paragraph preceding 
(2.28), and a similar comment applies to c, a (b',c') obtained in this way is surely possible, and therefore 
optimal. On the other ha nd, r eversing the sign of just one component of b, assuming all three components 



are non-zero, will violate ( 6.1C ), and thus the resulting pair is not possible, since if it were possible it would 
be optimal. And the same comment applies to c when its three components are all positive. 

In summary, the task of identifying all optimal pairs (b, c) reduces to that of identifying all positive 
optimal pairs. All others are obtained from these by changing the signs of two components of b and/or 
two components of c, since this operation always yields an optimal pair, and any optimal pair which is 
non-positive is the result of such an operation. In addition, if (b, c) is a positive optimal pair, it follows from 
the definition of optimality and the properties of the map G, see (|6.4[), that 



c = G(b), b = G(c), 
that is, the chain ( |6.5| ) will work in either direction, and, therefore, 

P>0, 7=iA-/3>0. 



(6.11) 



(6.12) 



However (3.11), or t he eq uivalent (6.12), while it implies (6.8) and the corresponding condition for /3, and 
is thus stronger than (6.10) or the corresponding condition on c, is a necessary, but not sufficient condition 
for (b, c) to be a positive optimal pair, because (as noted following (6.3)) there might be a pair {b',c') for 
which c' = G{b') and b' = G{c'), with b < b' and c < c'. To examine this possibility, it is useful to study 
the Jacobian matrix of the map G. As shown in App. one can relate the differentials of b and c through 



dc = (16/34)"V- db 



(6.13) 



or its inverse 



db^ {16j4,)-^K-dc, 



(6.14) 



where 74 is defined in (6^), and (3^ is the corresponding quantity for /3. The 3x3 matrix J has components 

Jqq — hq'hq"^ ^qq' — hq'Cq"^ ^q' q hqCq" ^ (6.15) 

while those of K are 



qq 



hq'hqii 



K, 



qq 



-hq'bq'i 



q q 



'hqhqn 



and h = (hi, /i2, h^) is a vector whose components are defined by 



hq = 2i/3of3q - Pq^Pqn) = 2(7o7g " 7?' 7?" ) • 



(6.16) 



(6.17) 



Let us assu me that 74 > 0, as will be the case except on the boundaries of the region ( 6.12| ); note that this 
implies, ( |6.6[) , that each component of b is strictly positive. If we can find a dc whose components are all 
positive and which, inserted in ( |6.13 ), yields a db whose components are also positive, then it is clear that 
(b, c) cannot be a positive optimal pair, for there is a better (b', c') in its immediate vicinity. It turns out, 
see App. that under the conditions (6.12), at most one of the hq can be negative. If we suppose that 



hi < 0, h2> 0, /13 > 0, 



(6.18) 



the elements Kgi o f the first column of the K matrix are all strictly positive, and therefore if dc = (1, 0, 0) 
is inserted in (6.14), the resulting db will have all three components strictly positive. Clearly the same will 
be true for a dc wit h all positive components which is sufficiently close to the one we have just considered, 
and, consequently, ( |6.18 ), or any other case in which two components of h are positive and one is negative, 
is inconsistent with (b, c) being a positive optimal pai r, as long as 74 > 0. 

What this suggests is that in addition to ( |6.11 ) or (6.12), we need the condition 
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h>0, 



(6.19) 



in order to ensure that ( b, c) is a positive optimal pair. Note that we have not established with complete 
mathematical rigor that ( |6.19|) is a necessary condition for a positive optimal p air, si nce the argument given 
above no longer works (at least in any simple form) if, for example, /12 > in (6. IS) is replaced by /i2 = 0. 
Such "boundary cases" require additional study, which we have carried through in some simple cases, but not 
in co mplet e detail. In addition, even if (6.19) is necessary, it does not follow that it is sufficient, in conjunction 



with (6.11), to guarantee that (6, c) is a positive optimal pair, because it refers only to local properties of the 
G map. While the region defined by ( |6.19 ) along with (6.12) is connected, we know of no obvious reason why 
a given (6, c) in this region must be connected to a better (b, c), assuming one exists, through some path 
along which each component changes monotonically. Such a path can be ruled out by a minor extension of 
the above analysis as long as everywhere on it all the components of h, together with 74 are strictly positive. 
But paths along which some components of b or c increase while others decrease, followed later by increases 
and decreases of other components, the final result being an increase in all components, are easy to imagine. 

Having not succeeded in supplying an analytical argument, we attacked the pr oblem nu merica lly in the 
following way. Vectors b were chosen at random in the "good" region defined by ( |6.12 ) and ( 6.19 ), and, for 
each such b, further vectors b lying in the "good" region and also satisfying b > b were chosen at random, 
and we checked to see whether 



G(6) > Gib). 



(6.20) 



With some 4000 choices of b, and for each of these 100,000 choices of b, we found no cases in which (|6.20| ) 
was satisfied, whereas if we made a similar test using b vect ors ly ing outside the "good" region, a random 
search turned up many examples of vectors b > b satisfying ( |6.20| ). 

Consequently, we believe that the necessary and sufficient condition for (6, c) to be a positive optimal pair 
is that b and c are both generated using ( |6.5| ) from a four vector /3 satisfying the conditions 

(6.21) 

-along with the normalization 



for q — 1,2 and 3 — note that the second inequality is equivalent to ( 6.19 ) 



condition (4.S 



Let us define the class P to be the collection of all four-vectors which satisfy the two conditions in (6.21) 
for all q between 1 and 3, whatever the value of Also, we shall say that any two vectors ^ and 77 

belonging to class P are in (or have) the same order provided, for 1 < p < 3 and 1 < 9 < 3, 



sign(Cp - ^9) = sign(77p - 77,), 



(6.22) 



where sign(x) is +1, or —1 for x positive, zero, or negative. For example, if ^1 < ^2 < ^3, then 771 < '72 < 773, 
and vice versa; ^1 = ^2 if and only if 771 — 772. These definitions lead to the following useful result: 

Theorem. If ^ is a four vector in class P, and 77 another four vector obtained from ^ by one of the three 
operations (a), (b), and (c) given below, or by the application of any combination of these operations carried 
out in succession, then 77 is also in class P, and 77 has the same order as ^. The three operations are: 

(a) rj = — (k^o, k£,i, fc^2, ^Cs) for a constant A; > 0. 

(b) 77 = = (Co>Ci>Cf,?f) for an exponent a > 0. 

(c) 77 = A • e 

The proof is straightforward, except for (c), for which see App. y. 

The theorem is useful when applied to (|6.5[), because each step on the way from 6 to c is one of the 



operations (a), (b) or (c), or a combination of (a) and (c). Consequently, if f3 satisfies the conditions (6.21) 
they are also satisfied by 7, and also by b and c. In particular, if /3 is normalized. 



then for 1 < g < 3, 



j=0 



< 6„ < 5o = 1, 



bq > bq'bq" 



(6.23) 



(6.24) 



Note that (6.24) is not only implied by (6.21) together with the normalizati on (6^23| ), it also implies ( |6.21 
and ( 6.23 ), and also the set of conditions for c — (1, c) which correspond to ( 3.24 ). 
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Indeed, ( 6.24 ) is a necessary and sufficient condition for a triple of num bers b = (b i, &2, ^3) to belong to 
a pos itive optimal pair, assuming that the latter can be characterized by ( |6.12| ) and ( 6.19| ). The reason is 
that (6.24) tells us that b — (1,6) belongs to class P, and therefore the four vector /^"belongs to class P 
and has non-negative components. This, together with the normalization ( |6.23| ) corresponding to &o = 1 in 
( 3.24 ) means that b is p hysic ally po ssible . Hence the pair (b, G{b)) is also possible, it is obviously positive, 
and it is optimal, since (6.21), thus (6.19), is sat isfied. 

In addition, the theorem tells us that if ( |6.24 ) holds and the semi-axes are in the order 



h <b2 < bs, 

then the f3q, 7^ are in the same order, and likewise 

Cl < C2 < C3, 



(6.25) 



(6.26) 



for c = G{b), the other member of the optimal pair. In addition, if two of the B semi-axes are equal to each 
other, say bi = 62, then the corresponding C semi-axes are also equal, ci — C2. Note, however, that there is 
no reason to expect 61 to be equal to ci; one ellipsoid can be small and the other large. 

In summary, a necessary and sufficient condition that (6, c) be a positive optimal pair is that one member 
of the pair satisfy (6.24) and the other member be obtaine d by applying the optimization map G to the first. 
The second member will then also satisfy the conditions (6.24), and b and c will have the same order. All 
optimal pairs which are not positive optimal pairs are obtained from some positive optimal pair by reversing 
the signs of two components of the first member of the pair, and/or two components of the second member. 



B. Particular Examples 



Isotropic copies in which 

&i = 62 = &3 = r, Cl = C2 = C3 = s, (6.27) 



that is, the Bloch ellipsoids for B and C are spheres of radii r and s, are obtained by letting 



/3i = /32 = /33 = •^(l-/3o')/3. 



Using (4.9) and (4.17), one can show that 



l-r+ v/(l-?-)(l + 3r) 



(6.28) 



(6.29) 



and, by symmetry, r as a function of s is given by precisely the same function al for m. The relationship 
between the two is shown in Fig. |l|. Since r is between and 1, the condition (|6.24 ) for optimal pairs is 
satisfied. In other words, any point that lies outside the region enclosed by this curve and the two axes is 
prohibited; this is identical to the no-cloning bound in fl^. 




FIG. 1. The relationship between the radii (s and r) of the two output Bloch spheres in the case of isotropic 
copying. Identical copies occur when r = s — 2/3. 
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As one might expect, there is a certain "complementarity" between the quahty of the copies emerging in 
the B and C channels: as r increases, s decreases. Additional insight into the source of this complementarity 
comes from studying anisotropic situations. Consider, in particular, the extremely anisotropic case in which 

/3i = /32 = 0, < /33 < 1/V2; (6.30) 



the upper bound on ensures that (6.21) is satisfied. The corresponding b and c for optimal copies are 
given by 



&i = 62 = 1 - 2/?|, 63 = 1, 
Cl = C2 = 0, C3 = 2/33a/i-/?; 



(6.31) 
(6.32) 



Note that as /S^ increases, the quality factor C3 for the third principal mode of the C copy increases whereas 
the factor for the same mode of B remains perfect, &3 = 1. On the other hand, the quality factors for modes 
1 and 2 of y8 decrease as (3^ increases. Thus improving the quality of a particular mode in one channel leads 
to a decrease in optimal quality of all the "perpendicular" modes of the other channel. In particular, ci and 
C2 are (worthless copies) in this case because of the perfect quality for mode 3 of B. 



VII. QUANTUM CIRCUIT 



A complicated unitary transformation can often be decomposed into several simpler transformations and 
expressed in a pictorial fashion by using a quantum circuit. This is helpful both for understanding the trans- 
formation and for implementing it in a systematic way. It is worth noting that any unitary transformation 
can be produced by a circuit which only uses two-qubit XOR gates along with an appropriate collection of 
one-qubit gates pTf . 

From Sec. the optimal copying procedure corresponding to c — G{b) can always be achieved, for any 
given b, by using a transformation which generates the isometry (|4.13| ) . One example is given by the circuit 
in Fig. 0(a), where the input state is a tensor product \a) |e), with \a) arbitrary, and 



/3o|00)+/3i|01)+/32|ll)+/33|10). 



(7.1) 



Since |e) involves only real coefficients, it can be produced by a preparation circuit as shown in pO| ]. The 
three horizontal lines represent the three qubits going from left to right, as time increases, throug h fou r 
quantum gates. The order of the qubits from top to bottom is the same as the left-to-right order in (4.13). 



|a> 



|e> 



(a) 



-e- 



b 

c 



|a> 
|e> 



First part 



Second part 



<H>- 



(b) 



b 

c 
d 



FIG. 2. (a) Optimal copy circuit. The qubit to be copied enters the circuit as |a) on the left, and the copies emerge 
in b and c on the right, (b) Alternative circuit, see discussion in text. 

The first gate in Fig. ||(a) produces the Hadamard transform 

|0)-| + > = (|0> + |1))/V2, 

|1)-|-) = (|0)-|1))/V2, (7.2) 

on a single qubit. The remaining three gates are XOR gates acting on different pairs of qubits; each 
corresponds to the transformation 
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|00) ^ |00), |10) ^ 111), 

|01) ^ \01), 111) ^ |10), (7.3) 

in which the left qubit in this formula is the "controlling" and the right qubit the "controlled" qubit. If the 
controlling qubit is |0), the controlled qubit is left unchanged; if the controlling qubit is |1), the controlled 
qubit is flipped from |0) to |1), or |1) to |0), which is known as "amplitude-flipping". In Fig. ^(a) the 
controlling qubit is indicated by a solid dot, and the corresponding controlled qubit by a plus inside a circle. 
Thus the controlling qubit is the top, bottom, and middle qubit in the first, second, and third XOR gates. 

The transformation produced by the circuit in Fig. ^(a) can be more easily understood if one uses the 
alternative circuit in Fig. ^b), which produces exactly the same overall unitary transformation. It employs 
another gate, indicated by tt inside a circle connecting two black dots, which produces the transformation 

|00) <-* |00), |10> |10), 

|01)^|01), 111) ^ -111), (7.4) 
where the phase of |11) is multiplied by a factor of e'^ = — 1. It flips one qubit between 1+) and | — ), see 



(7.2), if the other qubit is |1). This is known as a "phase flip", in contrast to the "amplitude flip" produced 
by an XOR gate. Note that in this case either qubit can be regarded as controlling the other. 
Both circuits in Fig. ^ carry out the transformation 

|a)(8|e)K^^(a,|a))(^/3,|e,) (7.5) 



corresponding to ( pA\) combined with (4.3), and thus the isometry (4.13) on the input qubit \a). But 
Fig. g(b) is a bit easier to interpret. In the first part of the circuit, the lower two qubits control the top 
qubit: the middle qubit, if it is |1), flips the phase, and the bottom qubit, if |1), flips the amplitude. Thus 
this part of the circuit results in 

|a)»|00)H->(cro|a))®|00), 
|a)0|Ol)^(cri|a))®|Ol), 

\a) (g) 111) (o-iCTala)) ® |11) = (-j(T2|a)) » |11), 

\a) (g)\10) ^ {cr3\a))(g)\10), (7.6) 

where (Tq is the identity, cti flips the amplitude, 173 flips the phase, and aia^ flips the phase and then the 
amplitude. Consequently, a linear superposition of the maps in (|7.6|) results in 



|a)®|e)K^^(a,|a))®A|Q), (7.7) 
I 

where 

|6o) = |00), |ei) = |01), |e2) = -i|ll), \h} = |10). (7.8) 



The second part of the circuit in Fig. |2|(b) is a unitary transform ation on the lower two qubits which maps 
each \ei) into |e;), so that the effect of the two parts combined is ([t^). 

This perspective also provides an intuitive meaning for the Pi's. The action of the superoperator Vg, 



( 2.16 ), on the input state 

a = |a)(a|, (7.9) 

given by 



I 



is determined by the first part of the circuit in Fig. ||(b), thus by (7/7). One can think of (|7.10| ) as a 



probabilistic mixture of states obtained by applying the unitary transformation ai with probability f3f to 
the input state \a). In other words, f3f for Z ^ is the probability of producing a noise of type I (amplitude 
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or phase flipping, or the combination) in the AB channel. This deflnes a PauH ch anne l, in t he no tation 
of and since the AC channel is of the same type — by the symmetry between (4.12) and ( 4.15| ) — the 
corresponding transformation constitutes a Pauli cloning machine as defined in It is worth mention 

that even though this Pauli machine is optimal, there exist optimal machines that are not Pauli machines 

Finally, note that if |e) is of the tensor product form 

- D^y\0) + ^/D^\1)) ® (Vl - Duv\0) + /D^|1)) , (7.11) 

this circuit carries out the optimal eavesdropping i n ||Tl[| for the case of the BB84 cryptographic system ||l^ . 
Here D,j.y and D^y are the error rates defined in |ll|. With Pi = 02 = Ps = l/\/T2 in (|7.1|), this circuit 
carries out Buzek and Hillery's universal cloning. It is an alternative to the circuit given in [|10|. 



VIII. CONCLUSION 



We have shown how to produce two optimal copies of one qubit in the cases in which the quality of 
the copies can be different, and can depend upon which input mode (orthogonal basis) is employed. The 
measure of quality of the copies is a distinguishability measure on quantum states, and differs from the 
fidelity measures used in previous published work. In particular, it uses a geometrical representation in 
which the output possibilities correspond to ellipsoids in the respective Bloch spheres, with a larger ellipsoid 
corresponding to higher quality. The copy qualities are "complementary" in the sense that improving the 
quality of one copy for a particular input mode tends to degrade the quality of the other copy for a different 
set of input modes. Optimal copies can be achieved by means of a relatively simple quantum circuit. 

This represents significant progress towards quantifying the no-cloning theorem of quantum information 
theory. It is useful in applications to quantum cryptography, in which the "copies" go to the eavesdropper 
and to the legitimate receiver, and also to quantum decoherence processes, in which one "copy" disappears 
into the environment. In both of these applications, the copies need not be identical, and their qualities can 
depend upon the input signal. 

There are several respects in which one might hope to extend the results presented here. Producing 
more than two copies of one qubit with (in general) different qualities for the different copies, and qualities 
which depend upon the input mode remains to be studied. Extending our results to higher-dimensional 
Hilbert spaces is a significant challenge, because there seems to be no convenient geometrical representations 
analogous to a Bloch sphere, nor an analytic form of distinguishability measure for more than two states. 
Nonetheless, recent progress by Zanardi and Cerf ||l^ in characterizing anisotropic higher-dimensional 
copies is encouraging. In addition, one could study optimization using other measures of quality or informa- 
tion pH , including Shannon's mutual information. Of course it would be extremely interesting to find some 
general point of view which unifies all the results on quantum copying obtained up to the present. 
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APPENDIX A: PROPERTIES OF L{jk; Im) 

The complex conjugate of 

L(jfc; Im) = lTr[(jjai(Jk(J,n] (Al) 
is the trace of the cr's in reverse order. Invariance of the trace under cyclic permutation then shows that 

L{kj]lm) = L{jk]ml) = L{lm;jk) = L*{jk;lm). (A2) 



21 



That is, interchanging j and fc, or interchanging I and m, or interchanging jk with Im turns L{jk; Im) into 
its complex conjugate. Suppose that two 4x4 matrices P and Q are related by 



(A3) 



Then ( [A^ ) implies that if Q is real, P is Hermitian, and if Q is Hermitian, P is real. 

Explicit values for L{jk] Im) can be obtained in the following way. Suppose that C is the number of the 
four indices j, fc, Z, and m that are zero. It is obvious that if ^ = 4, i = 1, and if C = 3, L = 0. When C = 2, 
L will vanish unless the non-zero indices are equal, in which case 



L{m-qq)=L{Qq-Qq) = l. 
If C = 1, i will vanish unless all the indices are unequal, and by explicit calculation, 

L{Oq-q'q")^~i, 



(A4) 



(A5) 



using the convention ( p.lC| ). If C = 0, two of the indices must be equal, and L vanishes unless the other two 
are also equal to each other. So L is zero except for 



L{qq; qq) = L{qq' ; qq') = 1, L{qq; q'q') = -1, 



(A6) 



with q ' ^ q. The other non-zero elements of L can be obtained from (A4)-(A6) by permuting the indices, 
using @. 

Regarded as a map, (|A3|), L carries certain subspaces of the 16-dimensional space of 4 x 4 matrices into 
themselves. Thus if Q^k — except when jk is equal to 00, 11, 22, and 33, then Pjk has the same character, 
and one can write, see ( 2.15| ), 



(A7) 



where 



A = 



is its own inverse, apart from a factor of 4: 




(A8) 



(A9) 



The other invariant subspaces of L correspond to jk of the form Oq, qO, q'q" , q"q' in the notation of ( 2.1Cl| ). 
Hence, thought of as a 16 x 16 matrix, L is block diagonal with four 4x4 matrices as di agon al blocks. One 
of these matrices is (A8), and the others can be written down explicitly using (A4) to (A6). In each case 
the square of the 4x4 matrix is 4 times the identity. Consequently, is its own inverse, or 



L{jk; Im) L(lm; rs) = 4:djrSks 



(AlO) 



APPENDIX B: SOME PROPERTIES OF Tr{\A\) 
The following results hold for a finite-dimensional Hilbert space H. 

Theorem 1. Let B and C be any two positive (Hermitian with non- negative eigenvalues) operators, and 
let 

A = B-C. (Bl) 
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Then 



Tr(|A|) < Tr(B) +Tr(C). 
The proof consists in noting that if is an orthonormal basis which diagonahzes A, 

A ^^aj\aj){aj\, 



(B2) 
(B3) 



then it follows from (Bl) that 



\a,\ = \{a,\A\a,)\ < \{a,\B\a,)\ + |(a,|C|a,)| 
= + (aj\C\aj), 



(B4) 



where the final equality is a co nse quence of the fact that diagonal matrix elements of a positive operator 
cannot be negative. Summing ( B4 ) over j yields ( p32| ) 
Theorem 2. Let A be a Herniitian operator, and let 



k 

be a decomposition of the identity as a sum of mutually orthogonal projectors. Then 

5]Tr(|F,APfc|)<Tr(|A|). 

k 

In particular, if {l/Jfc)} is any orthonormal basis of Ti, 

J2m\Am<Tr{\A\), 



since we can set Pk = \Pk){f3k\ in (B6). 
To prove this theorem, write 



A = A'- A", 



(B5) 



(B6) 



(B7) 



(B8) 



with A' that part of the sum (B3) with aj > 0, and —A" the remainder. Then A' and A" are positive 
operators, and Tr(|j4|) is the sum of their traces. Define 



Ak = PkAPk, A'k = PkA'Pk, A'l = PkA"Pk, 

and apply theorem 1 to Ak = A'j^ — A'^, to obtain 

Tr(|Afe|) < Tr{A'k) + Tr^O = Tr(A'Pfc) + Tr(A"Pfc), 

where the equality uses F| = Pk. Summing (^ ) over k, see (p5|), yields (p^). 
Theorem 3. Let A be a Hermitian operator onH — T ^ G, and 



Then 



The proof consists in writing 



A^^TvgiA). 

< Trw(|A|). 



where A'yr and A'^ are the partial traces over Q of A' and A" in 
theorem 1 tells us that 

Tr^(|A^|)<Tr^(A^)+Tr^(A^) 
^Tvn{A')+TvHiA") = TTHi\A\). 



(B9) 
(BIO) 

(Bll) 
(B12) 



(B13) 

As these are positive operators, 

(B14) 
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APPENDIX C: DIFFERENTIAL OF THE OPTIMIZATION MAP 



The differential relationship (6.13) can be obtained using the expressions 



from (4.9) and (i.l7), together with, see (4 



&o = CO = /3o' + Pi + Pi + 1% = 1. 



(CI) 
(C2) 



(C3) 



Here and in what follows, q, g', and g" are related using the convention in ( ^.10 ). Let / be some function of 
Po, Pi, f32, Ps on the manifold (|C^). Its differential can be written as 



(C4) 



3 3 

df = Y.{df/dp,)dp, = Po' E(^«/) dP,^ 

j=0 9=1 



where dPo has been eliminated on the right side by setting the differential of (C3) equal to zero, and 

A,/ := Podf/dpg - pgdf/dPo. (C5) 



Using (|C1|), (|C^) and (|C4|), one obtains 



Po db = 4R ■ d(3, 
Padc = S- df3, 

with d(3 — {dPi, dP2, dPs), and the components of the 3x3 matrices R and S are given by: 

Rqq 0, Rq' q Rq" q Pq, 

Sqq — 2(/3q — /3g), Sqqi — Sq'q — Cq" . 

The inverse of R is the matrix Q, with components 

Qqq — Pq' Pq" J Qqq' — Qqq" — Pq' Pq" , 

in the sense that 

Q • R = R - Q = -2PiI, 



(C6) 
(C7) 



(C8) 
(C9) 



(CIO) 



(Cll) 



where Pi — P0P1P2P3, an d I i s the identity matrix Mu ltiplying ( |Cq ) by Q, solving for df3, and inserting the 
result in (C7) results in ( 6.13| ), with J defi ned in ( 6.15| ), after some straightforward algebra. A symmetrical 
argument with 7 in place of P yields (6.14). 



APPENDIX D: MISCELLANEOUS RESULTS FOR SEC. VI 



That at most one of the hq, q = 1,2, 3, can be negative, follows from the condition 

< /3g < /3o for g = 1,2,3, 



(Dl) 



which is implied by ( 6.12| ), since if the jj are non- negative, and one writes out the components of /? in terms 
of those of 7, (4.14), it is obvious that Pq is at least as large as any of the Pq for q > 0. The implications of 
(Dl) can be worked out by assuming, for convenience, the order 

< /3i < /?2 < /33 < Po- (D2) 

Then it is obvious that 

PoP3>P2Pl, PoP2>P3Pl- (D3) 
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so that, see ( 3.17 ), /12 and ^13 are non-negative, while hi might be negative. Similar conclusions follow for 
other orderings. 

The proof of the theorem in Sec. VI for the case ry = A-^ can be constructed using the explicit expressions, 
see @, 

?7o = ^0 +6 + 6 +C3, 
?7i = <^o + Ci - 6 - £.3, 

'72 = -^0 - Cl + 6 - £.3, 

?73 =6-6-6+6- (D4) 



From these it follows that 



q' 



(D5) 



in the notational convention of ( 2.10 ), and this implies ( 6.22 ), which shows that 77 has the same order as 6 
Next we show that if ^ belongs to class P, that is, its components s atis fy the inequalities (6.21), the same is 
true of 77. Because the 6 are non- negative, it follows at once from (D4) that 



Next, a consequence of (D4) is 



VaVq - Vq'Vq" = 4(66 ^ 6'6")> 



(D6) 



(D7) 



so that if the components of ^ satisfy the second inequality in (6.21), so do those of rj. All that remains is 
to show that 



Vq>0- 

If we choose the order of 6, for convenience, to be 

< 6 < 6 < 6 <6, 



(D8) 



(D9) 



then r]i has the same order, and it suffices to show that ryi > 0. Let us assume ryi < 0, which by (D4) implies 

6-6 < 6 -6- (Dio) 

By (p9|), 6 is either zero or positive. If ^2 = 0, then (D9) implies that ^1 = 0, and ( D10| ) yields 6 < 6j in 
violation of (|D|). If 6 > 0, then (^ and (pTcj) tell us that 



6(6-6) < 6(6 -6) 



(Dll) 



or 66 < 661 which is to say, the components of ^ do not satisfy the second inequality in ( |6.21 ). This 
completes the proof. 
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